Perceptual audio coding using adaptive pre- and post-filters and lossless compression

نویسندگان

  • Gerald Schuller
  • Bin Yu
  • Dawei Huang
  • Bernd Edler
چکیده

This paper proposes a versatile perceptual audio coding method that achieves high compression ratios and is capable of low encoding/decoding delay. It accommodates a variety of source signals (including both music and speech) with different sampling rates. It is based on separating irrelevance and redundancy reductions into independent functional units. This contrasts traditional audio coding where both are integrated within the same subband decomposition. The separation allows for the independent optimization of the irrelevance and redundancy reduction units. For both reductions, we rely on adaptive filtering and predictive coding as much as possible to minimize the delay. A psycho-acoustically controlled adaptive linear filter is used for the irrelevance reduction, and the redundancy reduction is carried out by a predictive lossless coding scheme, which is termed weighted cascaded least mean squared (WCLMS) method. Experiments are carried out on a database of moderate size which contains mono-signals of different sampling rates and varying nature (music, speech, or mixed). They show that the proposed WCLMS lossless coder outperforms other competing lossless coders in terms of compression ratios and delay, as applied to the pre-filtered signal. Moreover, a subjective listening test of the combined pre-filter/lossless coder and a state-of-the-art perceptual audio coder (PAC) shows that the new method achieves a comparable compression ratio and audio quality with a lower delay.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Lossless and Perceptual Coding of Digital Audio

We have seen rapid progress in high-quality compression of wideband audio signals. Today’s coding algorithms can achieve substantially better compression than was thought possible only a few years ago. In the case of audio coding with its bandwidth of 20 kHz and more, the concept of perceptual coding has paved the way for significant bit rate reductions. However, multiple coding can reveal orig...

متن کامل

Improved Forward-Adaptive Prediction for MPEG-4 Audio Lossless Coding

MPEG-4 Audio Lossless Coding (ALS) is a new addition to the suite of MPEG-4 audio coding standards. The ALS codec is based on forward-adaptive linear prediction, which offers remarkable compression even with low predictor orders. Nevertheless, performance can be significantly improved by using higher predictor orders, more efficient quantization and encoding of the predictor coefficients, and a...

متن کامل

Integer Wavelet Transform Based Lossless Audio Compression

In this paper we propose the use of integer wavelet [2] as a decorrelation stage for adaptive context based lossless audio coding. The original wideband audio signal is first decomposed in wavelet subbands. The resulted coefficients are integer valued and therefore can be transmitted using an adaptive context based method, in a lossless manner, the decoder being able to reconstruct them and aft...

متن کامل

Mpeg­4 Als – the Standard for Lossless Audio Coding

The MPEG-4 Audio Lossless Coding (ALS) standard belongs to the family MPEG-4 audio coding standards. In contrast to lossy codecs such as AAC, which merely strive to preserve the subjective audio quality, lossless coding preserves every single bit of the original audio data. The ALS core codec is based on forward-adaptive linear prediction, which combines remarkable compression with low complexi...

متن کامل

Integer wavelet transforms based lossless audio compression

In this paper we propose the use of integer wavelet [2] as a decorrelation stage for adaptive context based lossless audio coding. The original wideband audio signal is first decomposed in wavelet subbands. The resulted coefficients are integer valued and therefore can be transmitted using an adaptive context based method, in a lossless manner, the decoder being able to reconstruct them and aft...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • IEEE Trans. Speech and Audio Processing

دوره 10  شماره 

صفحات  -

تاریخ انتشار 2002